Unipro UGENE NGS pipelines and components for variant calling, RNA-seq and ChIP-seq data analyses
نویسندگان
چکیده
The advent of Next Generation Sequencing (NGS) technologies has opened new possibilities for researchers. However, the more biology becomes a data-intensive field, the more biologists have to learn how to process and analyze NGS data with complex computational tools. Even with the availability of common pipeline specifications, it is often a time-consuming and cumbersome task for a bench scientist to install and configure the pipeline tools. We believe that a unified, desktop and biologist-friendly front end to NGS data analysis tools will substantially improve productivity in this field. Here we present NGS pipelines "Variant Calling with SAMtools", "Tuxedo Pipeline for RNA-seq Data Analysis" and "Cistrome Pipeline for ChIP-seq Data Analysis" integrated into the Unipro UGENE desktop toolkit. We describe the available UGENE infrastructure that helps researchers run these pipelines on different datasets, store and investigate the results and re-run the pipelines with the same parameters. These pipeline tools are included in the UGENE NGS package. Individual blocks of these pipelines are also available for expert users to create their own advanced workflows.
منابع مشابه
Halvade-RNA: Parallel variant calling from transcriptomic data using MapReduce
Given the current cost-effectiveness of next-generation sequencing, the amount of DNA-seq and RNA-seq data generated is ever increasing. One of the primary objectives of NGS experiments is calling genetic variants. While highly accurate, most variant calling pipelines are not optimized to run efficiently on large data sets. However, as variant calling in genomic data has become common practice,...
متن کاملSystems Biology Analyses in Chicken: Workflow for Transcriptome and ChIP-Seq Analyses Using the Chicken Skin Paradigm.
With advances in molecular biology, various biological phenomena can now be explored at higher resolution using mRNA sequencing (RNA-Seq) and chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-Seq), two powerful high-throughput next-generation sequencing (NGS) technologies. While methods are used widely in mouse, human, etc., less information is available in other animal...
متن کاملNext-Generation Sequencing for Personalized Cardiovascular Disease Care
Next-Generation Sequencing for Personalized Cardiovascular Disease Care Cardiovascular disease (CVD) is the leading cause of death worldwide. Prediction and prevention of CVD, such as coronary artery disease and atherosclerosis, traditionally depend on identification of risk factors. These factors are effective in the general assessment of CVD risk but are not consistent indicators for all indi...
متن کاملA quality control system for profiles obtained by ChIP sequencing
The absence of a quality control (QC) system is a major weakness for the comparative analysis of genome-wide profiles generated by next-generation sequencing (NGS). This concerns particularly genome binding/occupancy profiling assays like chromatin immunoprecipitation (ChIP-seq) but also related enrichment-based studies like methylated DNA immunoprecipitation/methylated DNA binding domain seque...
متن کاملSequence analysis aRNApipe: A balanced, efficient and distribut- ed pipeline for processing RNA-seq data in high performance computing environments
Summary: The wide range of RNA-seq applications and their high computational needs require the development of pipelines orchestrating the entire workflow and optimizing usage of available computational resources. We present aRNApipe, a project-oriented pipeline for processing of RNA-seq data in high performance cluster environments. aRNApipe is highly modular and can be easily migrated to any h...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 2 شماره
صفحات -
تاریخ انتشار 2014